Overview
Brought to you by YData
Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 90767 |
| Missing cells | 13714 |
| Missing cells (%) | 0.7% |
| Total size in memory | 15.2 MiB |
| Average record size in memory | 176.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Text | 11 |
merch_zipcode has 13714 (15.1%) missing values | Missing |
amt is highly skewed (γ1 = 26.29357902) | Skewed |
trans_num has unique values | Unique |
is_fraud has 90242 (99.4%) zeros | Zeros |
Reproduction
| Analysis started | 2025-06-23 17:29:36.620836 |
|---|---|
| Analysis finished | 2025-06-23 17:29:37.623140 |
| Duration | 1 second |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
cc_num
Real number (ℝ)
| Distinct | 949 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.217238762 × 1017 |
| Minimum | 6.041620718 × 1010 |
|---|---|
| Maximum | 4.992346398 × 1018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 709.2 KiB |
Quantile statistics
| Minimum | 6.041620718 × 1010 |
|---|---|
| 5-th percentile | 6.304848798 × 1011 |
| Q1 | 1.800400275 × 1014 |
| median | 3.520550088 × 1015 |
| Q3 | 4.651007078 × 1015 |
| 95-th percentile | 4.502539527 × 1018 |
| Maximum | 4.992346398 × 1018 |
| Range | 4.992346338 × 1018 |
| Interquartile range (IQR) | 4.47096705 × 1015 |
Descriptive statistics
| Standard deviation | 1.315095899 × 1018 |
|---|---|
| Coefficient of variation (CV) | 3.118381418 |
| Kurtosis | 6.062046089 |
| Mean | 4.217238762 × 1017 |
| Median Absolute Deviation (MAD) | 3.140652844 × 1015 |
| Skewness | 2.83104677 |
| Sum | 1.617115973 × 1018 |
| Variance | 1.729477223 × 1036 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.011893665 × 1015 | 249 | 0.3% |
| 3.725092582 × 1014 | 246 | 0.3% |
| 4.716561797 × 1015 | 241 | 0.3% |
| 3.02387559 × 1013 | 240 | 0.3% |
| 6.011109737 × 1015 | 235 | 0.3% |
| 4.364010865 × 1015 | 233 | 0.3% |
| 6.534628261 × 1015 | 233 | 0.3% |
| 6.011438889 × 1015 | 232 | 0.3% |
| 3.764452668 × 1014 | 232 | 0.3% |
| 3.521417321 × 1015 | 231 | 0.3% |
| Other values (939) | 88395 |
| Value | Count | Frequency (%) |
| 6.041620718 × 1010 | 100 | |
| 6.042292873 × 1010 | 96 | |
| 6.042309813 × 1010 | 50 | |
| 6.042785159 × 1010 | 41 | |
| 6.048700208 × 1010 | 36 | < 0.1% |
| Value | Count | Frequency (%) |
| 4.992346398 × 1018 | 156 | |
| 4.989847571 × 1018 | 71 | |
| 4.980323468 × 1018 | 44 | < 0.1% |
| 4.973530368 × 1018 | 79 | |
| 4.958589672 × 1018 | 113 |
merchant
Text
| Distinct | 693 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 709.2 KiB |
Length
| Max length | 43 |
|---|---|
| Median length | 36 |
| Mean length | 23.10297795 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | fraud_Kerluke Inc |
|---|---|
| 2nd row | fraud_Rempel PLC |
| 3rd row | fraud_Rodriguez Group |
| 4th row | fraud_Doyle Ltd |
| 5th row | fraud_Leffler-Goldner |
| Value | Count | Frequency (%) |
| and | 33057 | 15.6% |
| llc | 6829 | 3.2% |
| inc | 6450 | 3.1% |
| sons | 5124 | 2.4% |
| ltd | 5043 | 2.4% |
| plc | 4625 | 2.2% |
| group | 3512 | 1.7% |
| fraud_kutch | 698 | 0.3% |
| fraud_schaefer | 679 | 0.3% |
| fraud_streich | 676 | 0.3% |
| Other values (804) | 144580 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 203486 | 9.7% |
| r | 188491 | 9.0% |
| d | 149640 | 7.1% |
| e | 130340 | 6.2% |
| u | 129837 | 6.2% |
| n | 123982 | 5.9% |
| 120506 | 5.7% | |
| f | 97747 | 4.7% |
| _ | 90767 | 4.3% |
| o | 79079 | 3.8% |
| Other values (45) | 783113 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2096988 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 203486 | 9.7% |
| r | 188491 | 9.0% |
| d | 149640 | 7.1% |
| e | 130340 | 6.2% |
| u | 129837 | 6.2% |
| n | 123982 | 5.9% |
| 120506 | 5.7% | |
| f | 97747 | 4.7% |
| _ | 90767 | 4.3% |
| o | 79079 | 3.8% |
| Other values (45) | 783113 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2096988 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 203486 | 9.7% |
| r | 188491 | 9.0% |
| d | 149640 | 7.1% |
| e | 130340 | 6.2% |
| u | 129837 | 6.2% |
| n | 123982 | 5.9% |
| 120506 | 5.7% | |
| f | 97747 | 4.7% |
| _ | 90767 | 4.3% |
| o | 79079 | 3.8% |
| Other values (45) | 783113 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2096988 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 203486 | 9.7% |
| r | 188491 | 9.0% |
| d | 149640 | 7.1% |
| e | 130340 | 6.2% |
| u | 129837 | 6.2% |
| n | 123982 | 5.9% |
| 120506 | 5.7% | |
| f | 97747 | 4.7% |
| _ | 90767 | 4.3% |
| o | 79079 | 3.8% |
| Other values (45) | 783113 |
category
Text
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 709.2 KiB |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 10.51940683 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | misc_net |
|---|---|
| 2nd row | grocery_net |
| 3rd row | gas_transport |
| 4th row | grocery_pos |
| 5th row | personal_care |
| Value | Count | Frequency (%) |
| gas_transport | 9236 | |
| home | 8710 | |
| grocery_pos | 8542 | |
| shopping_pos | 8057 | |
| kids_pets | 7905 | |
| shopping_net | 6714 | |
| entertainment | 6602 | |
| personal_care | 6450 | |
| food_dining | 6417 | 7.1% |
| health_fitness | 6044 | 6.7% |
| Other values (4) | 16090 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 99729 | |
| e | 90443 | |
| o | 85904 | |
| n | 83545 | |
| t | 75538 | 7.9% |
| p | 75269 | 7.9% |
| _ | 72592 | 7.6% |
| r | 64371 | 6.7% |
| i | 58158 | 6.1% |
| a | 46881 | 4.9% |
| Other values (10) | 202385 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 954815 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 99729 | |
| e | 90443 | |
| o | 85904 | |
| n | 83545 | |
| t | 75538 | 7.9% |
| p | 75269 | 7.9% |
| _ | 72592 | 7.6% |
| r | 64371 | 6.7% |
| i | 58158 | 6.1% |
| a | 46881 | 4.9% |
| Other values (10) | 202385 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 954815 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 99729 | |
| e | 90443 | |
| o | 85904 | |
| n | 83545 | |
| t | 75538 | 7.9% |
| p | 75269 | 7.9% |
| _ | 72592 | 7.6% |
| r | 64371 | 6.7% |
| i | 58158 | 6.1% |
| a | 46881 | 4.9% |
| Other values (10) | 202385 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 954815 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 99729 | |
| e | 90443 | |
| o | 85904 | |
| n | 83545 | |
| t | 75538 | 7.9% |
| p | 75269 | 7.9% |
| _ | 72592 | 7.6% |
| r | 64371 | 6.7% |
| i | 58158 | 6.1% |
| a | 46881 | 4.9% |
| Other values (10) | 202385 |
amt
Real number (ℝ)
Skewed 
| Distinct | 20055 |
|---|---|
| Distinct (%) | 22.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69.93903577 |
| Minimum | 1 |
|---|---|
| Maximum | 13536.84 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 709.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.42 |
| Q1 | 9.63 |
| median | 47.28 |
| Q3 | 82.59 |
| 95-th percentile | 193.945 |
| Maximum | 13536.84 |
| Range | 13535.84 |
| Interquartile range (IQR) | 72.96 |
Descriptive statistics
| Standard deviation | 153.3540508 |
|---|---|
| Coefficient of variation (CV) | 2.192681799 |
| Kurtosis | 1427.651904 |
| Mean | 69.93903577 |
| Median Absolute Deviation (MAD) | 37.26 |
| Skewness | 26.29357902 |
| Sum | 6348156.46 |
| Variance | 23517.46488 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.7 | 56 | 0.1% |
| 1.25 | 55 | 0.1% |
| 1.54 | 48 | 0.1% |
| 1.64 | 47 | 0.1% |
| 2.43 | 46 | 0.1% |
| 1.27 | 46 | 0.1% |
| 1.09 | 46 | 0.1% |
| 1.24 | 43 | < 0.1% |
| 1.18 | 42 | < 0.1% |
| 1.4 | 42 | < 0.1% |
| Other values (20045) | 90296 |
| Value | Count | Frequency (%) |
| 1 | 14 | < 0.1% |
| 1.01 | 37 | |
| 1.02 | 31 | |
| 1.03 | 35 | |
| 1.04 | 38 |
| Value | Count | Frequency (%) |
| 13536.84 | 1 | |
| 9999.39 | 1 | |
| 8981.87 | 1 | |
| 8221.84 | 1 | |
| 8217.23 | 1 |
first
Text
| Distinct | 347 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 709.2 KiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 6.077329867 |
| Min length | 3 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Melody |
|---|---|
| 2nd row | Michael |
| 3rd row | Katelyn |
| 4th row | David |
| 5th row | Emily |
| Value | Count | Frequency (%) |
| christopher | 1867 | 2.1% |
| jessica | 1467 | 1.6% |
| robert | 1451 | 1.6% |
| david | 1401 | 1.5% |
| michael | 1376 | 1.5% |
| james | 1352 | 1.5% |
| john | 1166 | 1.3% |
| jennifer | 1143 | 1.3% |
| mary | 1133 | 1.2% |
| william | 1118 | 1.2% |
| Other values (337) | 77293 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 70063 | 12.7% |
| e | 60389 | 10.9% |
| n | 43345 | 7.9% |
| i | 43227 | 7.8% |
| r | 42213 | 7.7% |
| l | 26939 | 4.9% |
| h | 23991 | 4.3% |
| s | 22758 | 4.1% |
| t | 21674 | 3.9% |
| o | 18961 | 3.4% |
| Other values (39) | 178061 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 551621 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 70063 | 12.7% |
| e | 60389 | 10.9% |
| n | 43345 | 7.9% |
| i | 43227 | 7.8% |
| r | 42213 | 7.7% |
| l | 26939 | 4.9% |
| h | 23991 | 4.3% |
| s | 22758 | 4.1% |
| t | 21674 | 3.9% |
| o | 18961 | 3.4% |
| Other values (39) | 178061 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 551621 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 70063 | 12.7% |
| e | 60389 | 10.9% |
| n | 43345 | 7.9% |
| i | 43227 | 7.8% |
| r | 42213 | 7.7% |
| l | 26939 | 4.9% |
| h | 23991 | 4.3% |
| s | 22758 | 4.1% |
| t | 21674 | 3.9% |
| o | 18961 | 3.4% |
| Other values (39) | 178061 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 551621 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 70063 | 12.7% |
| e | 60389 | 10.9% |
| n | 43345 | 7.9% |
| i | 43227 | 7.8% |
| r | 42213 | 7.7% |
| l | 26939 | 4.9% |
| h | 23991 | 4.3% |
| s | 22758 | 4.1% |
| t | 21674 | 3.9% |
| o | 18961 | 3.4% |
| Other values (39) | 178061 |
last
Text
| Distinct | 474 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 709.2 KiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.107825531 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Thompson |
|---|---|
| 2nd row | Johnson |
| 3rd row | Wise |
| 4th row | Everett |
| 5th row | Hall |
| Value | Count | Frequency (%) |
| smith | 2010 | 2.2% |
| williams | 1611 | 1.8% |
| davis | 1528 | 1.7% |
| johnson | 1408 | 1.6% |
| rodriguez | 1184 | 1.3% |
| martinez | 1085 | 1.2% |
| jones | 970 | 1.1% |
| lewis | 877 | 1.0% |
| gonzalez | 877 | 1.0% |
| martin | 808 | 0.9% |
| Other values (464) | 78409 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 54832 | 9.9% |
| r | 46276 | 8.3% |
| a | 45735 | 8.2% |
| n | 42791 | 7.7% |
| o | 40707 | 7.3% |
| s | 33969 | 6.1% |
| l | 33894 | 6.1% |
| i | 30378 | 5.5% |
| t | 20377 | 3.7% |
| h | 15909 | 2.9% |
| Other values (38) | 189521 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 554389 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 54832 | 9.9% |
| r | 46276 | 8.3% |
| a | 45735 | 8.2% |
| n | 42791 | 7.7% |
| o | 40707 | 7.3% |
| s | 33969 | 6.1% |
| l | 33894 | 6.1% |
| i | 30378 | 5.5% |
| t | 20377 | 3.7% |
| h | 15909 | 2.9% |
| Other values (38) | 189521 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 554389 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 54832 | 9.9% |
| r | 46276 | 8.3% |
| a | 45735 | 8.2% |
| n | 42791 | 7.7% |
| o | 40707 | 7.3% |
| s | 33969 | 6.1% |
| l | 33894 | 6.1% |
| i | 30378 | 5.5% |
| t | 20377 | 3.7% |
| h | 15909 | 2.9% |
| Other values (38) | 189521 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 554389 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 54832 | 9.9% |
| r | 46276 | 8.3% |
| a | 45735 | 8.2% |
| n | 42791 | 7.7% |
| o | 40707 | 7.3% |
| s | 33969 | 6.1% |
| l | 33894 | 6.1% |
| i | 30378 | 5.5% |
| t | 20377 | 3.7% |
| h | 15909 | 2.9% |
| Other values (38) | 189521 |
gender
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 709.2 KiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | M |
| 3rd row | F |
| 4th row | M |
| 5th row | F |
| Value | Count | Frequency (%) |
| f | 49491 | |
| m | 41276 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 49491 | |
| M | 41276 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 90767 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| F | 49491 | |
| M | 41276 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 90767 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| F | 49491 | |
| M | 41276 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 90767 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| F | 49491 | |
| M | 41276 |
street
Text
| Distinct | 949 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 709.2 KiB |
Length
| Max length | 35 |
|---|---|
| Median length | 29 |
| Mean length | 22.21335948 |
| Min length | 12 |
Unique
| Unique | 26 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0362 Anderson Wall |
|---|---|
| 2nd row | 094 Owens Underpass |
| 3rd row | 674 Maureen Summit Apt. 276 |
| 4th row | 4138 David Fall |
| 5th row | 8851 Reese Neck |
| Value | Count | Frequency (%) |
| apt | 23051 | 6.4% |
| suite | 21237 | 5.9% |
| island | 1612 | 0.4% |
| michael | 1319 | 0.4% |
| common | 1246 | 0.3% |
| station | 1234 | 0.3% |
| islands | 1205 | 0.3% |
| fields | 1196 | 0.3% |
| brooks | 1178 | 0.3% |
| david | 1168 | 0.3% |
| Other values (1889) | 306431 |
Most occurring characters
| Value | Count | Frequency (%) |
| 270110 | 13.4% | |
| e | 125698 | 6.2% |
| a | 101371 | 5.0% |
| i | 90327 | 4.5% |
| t | 86943 | 4.3% |
| r | 76902 | 3.8% |
| n | 74611 | 3.7% |
| s | 72408 | 3.6% |
| l | 62403 | 3.1% |
| o | 61074 | 3.0% |
| Other values (52) | 994393 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2016240 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 270110 | 13.4% | |
| e | 125698 | 6.2% |
| a | 101371 | 5.0% |
| i | 90327 | 4.5% |
| t | 86943 | 4.3% |
| r | 76902 | 3.8% |
| n | 74611 | 3.7% |
| s | 72408 | 3.6% |
| l | 62403 | 3.1% |
| o | 61074 | 3.0% |
| Other values (52) | 994393 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2016240 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 270110 | 13.4% | |
| e | 125698 | 6.2% |
| a | 101371 | 5.0% |
| i | 90327 | 4.5% |
| t | 86943 | 4.3% |
| r | 76902 | 3.8% |
| n | 74611 | 3.7% |
| s | 72408 | 3.6% |
| l | 62403 | 3.1% |
| o | 61074 | 3.0% |
| Other values (52) | 994393 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2016240 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 270110 | 13.4% | |
| e | 125698 | 6.2% |
| a | 101371 | 5.0% |
| i | 90327 | 4.5% |
| t | 86943 | 4.3% |
| r | 76902 | 3.8% |
| n | 74611 | 3.7% |
| s | 72408 | 3.6% |
| l | 62403 | 3.1% |
| o | 61074 | 3.0% |
| Other values (52) | 994393 |
city
Text
| Distinct | 870 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 709.2 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 8.653177917 |
| Min length | 3 |
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Mound City |
|---|---|
| 2nd row | Norwalk |
| 3rd row | Scotts Mills |
| 4th row | Morrisdale |
| 5th row | Basye |
| Value | Count | Frequency (%) |
| city | 1496 | 1.3% |
| west | 1381 | 1.2% |
| saint | 1006 | 0.9% |
| north | 988 | 0.9% |
| falls | 913 | 0.8% |
| mount | 808 | 0.7% |
| new | 794 | 0.7% |
| lake | 769 | 0.7% |
| san | 712 | 0.6% |
| springs | 607 | 0.5% |
| Other values (898) | 103526 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 76376 | 9.7% |
| a | 65037 | 8.3% |
| n | 57989 | 7.4% |
| o | 57205 | 7.3% |
| l | 54773 | 7.0% |
| r | 52418 | 6.7% |
| i | 49399 | 6.3% |
| t | 42305 | 5.4% |
| s | 31107 | 4.0% |
| 22233 | 2.8% | |
| Other values (42) | 276581 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 785423 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 76376 | 9.7% |
| a | 65037 | 8.3% |
| n | 57989 | 7.4% |
| o | 57205 | 7.3% |
| l | 54773 | 7.0% |
| r | 52418 | 6.7% |
| i | 49399 | 6.3% |
| t | 42305 | 5.4% |
| s | 31107 | 4.0% |
| 22233 | 2.8% | |
| Other values (42) | 276581 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 785423 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 76376 | 9.7% |
| a | 65037 | 8.3% |
| n | 57989 | 7.4% |
| o | 57205 | 7.3% |
| l | 54773 | 7.0% |
| r | 52418 | 6.7% |
| i | 49399 | 6.3% |
| t | 42305 | 5.4% |
| s | 31107 | 4.0% |
| 22233 | 2.8% | |
| Other values (42) | 276581 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 785423 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 76376 | 9.7% |
| a | 65037 | 8.3% |
| n | 57989 | 7.4% |
| o | 57205 | 7.3% |
| l | 54773 | 7.0% |
| r | 52418 | 6.7% |
| i | 49399 | 6.3% |
| t | 42305 | 5.4% |
| s | 31107 | 4.0% |
| 22233 | 2.8% | |
| Other values (42) | 276581 |
state
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 709.2 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | MO |
|---|---|
| 2nd row | CA |
| 3rd row | OR |
| 4th row | PA |
| 5th row | VA |
| Value | Count | Frequency (%) |
| tx | 6531 | 7.2% |
| ny | 5819 | 6.4% |
| pa | 5693 | 6.3% |
| ca | 3961 | 4.4% |
| oh | 3350 | 3.7% |
| mi | 3264 | 3.6% |
| al | 2993 | 3.3% |
| il | 2950 | 3.3% |
| fl | 2923 | 3.2% |
| mo | 2637 | 2.9% |
| Other values (41) | 50646 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 25156 | |
| N | 19991 | 11.0% |
| M | 15517 | 8.5% |
| I | 12625 | 7.0% |
| T | 10758 | 5.9% |
| L | 10342 | 5.7% |
| O | 10162 | 5.6% |
| C | 9770 | 5.4% |
| Y | 9049 | 5.0% |
| X | 6531 | 3.6% |
| Other values (14) | 51633 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 181534 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 25156 | |
| N | 19991 | 11.0% |
| M | 15517 | 8.5% |
| I | 12625 | 7.0% |
| T | 10758 | 5.9% |
| L | 10342 | 5.7% |
| O | 10162 | 5.6% |
| C | 9770 | 5.4% |
| Y | 9049 | 5.0% |
| X | 6531 | 3.6% |
| Other values (14) | 51633 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 181534 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 25156 | |
| N | 19991 | 11.0% |
| M | 15517 | 8.5% |
| I | 12625 | 7.0% |
| T | 10758 | 5.9% |
| L | 10342 | 5.7% |
| O | 10162 | 5.6% |
| C | 9770 | 5.4% |
| Y | 9049 | 5.0% |
| X | 6531 | 3.6% |
| Other values (14) | 51633 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 181534 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 25156 | |
| N | 19991 | 11.0% |
| M | 15517 | 8.5% |
| I | 12625 | 7.0% |
| T | 10758 | 5.9% |
| L | 10342 | 5.7% |
| O | 10162 | 5.6% |
| C | 9770 | 5.4% |
| Y | 9049 | 5.0% |
| X | 6531 | 3.6% |
| Other values (14) | 51633 |
zip
Real number (ℝ)
| Distinct | 938 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48727.78369 |
| Minimum | 1257 |
|---|---|
| Maximum | 99783 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 709.2 KiB |
Quantile statistics
| Minimum | 1257 |
|---|---|
| 5-th percentile | 7208 |
| Q1 | 26041 |
| median | 48088 |
| Q3 | 72042 |
| 95-th percentile | 94619 |
| Maximum | 99783 |
| Range | 98526 |
| Interquartile range (IQR) | 46001 |
Descriptive statistics
| Standard deviation | 26940.08393 |
|---|---|
| Coefficient of variation (CV) | 0.5528690593 |
| Kurtosis | -1.096946247 |
| Mean | 48727.78369 |
| Median Absolute Deviation (MAD) | 23105 |
| Skewness | 0.08430307804 |
| Sum | 4422874742 |
| Variance | 725768122.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 48088 | 276 | 0.3% |
| 34112 | 250 | 0.3% |
| 80120 | 249 | 0.3% |
| 48438 | 246 | 0.3% |
| 73754 | 241 | 0.3% |
| 59448 | 241 | 0.3% |
| 76578 | 240 | 0.3% |
| 28405 | 235 | 0.3% |
| 5461 | 233 | 0.3% |
| 89512 | 233 | 0.3% |
| Other values (928) | 88323 |
| Value | Count | Frequency (%) |
| 1257 | 140 | |
| 1330 | 69 | |
| 1535 | 33 | < 0.1% |
| 1545 | 60 | |
| 1612 | 43 | < 0.1% |
| Value | Count | Frequency (%) |
| 99783 | 116 | |
| 99747 | 1 | < 0.1% |
| 99746 | 33 | < 0.1% |
| 99323 | 191 | |
| 99160 | 228 |
lat
Real number (ℝ)
| Distinct | 936 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.57377767 |
| Minimum | 20.0271 |
|---|---|
| Maximum | 66.6933 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 709.2 KiB |
Quantile statistics
| Minimum | 20.0271 |
|---|---|
| 5-th percentile | 29.8872 |
| Q1 | 34.6689 |
| median | 39.4055 |
| Q3 | 42.0144 |
| 95-th percentile | 45.8433 |
| Maximum | 66.6933 |
| Range | 46.6662 |
| Interquartile range (IQR) | 7.3455 |
Descriptive statistics
| Standard deviation | 5.070830013 |
|---|---|
| Coefficient of variation (CV) | 0.1314579571 |
| Kurtosis | 0.8212984229 |
| Mean | 38.57377767 |
| Median Absolute Deviation (MAD) | 3.3811 |
| Skewness | -0.1871314491 |
| Sum | 3501226.078 |
| Variance | 25.71331702 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 42.5164 | 276 | 0.3% |
| 26.1184 | 250 | 0.3% |
| 39.5994 | 249 | 0.3% |
| 42.9147 | 246 | 0.3% |
| 36.385 | 241 | 0.3% |
| 48.2777 | 241 | 0.3% |
| 30.592 | 240 | 0.3% |
| 34.2651 | 235 | 0.3% |
| 44.3346 | 233 | 0.3% |
| 39.5483 | 233 | 0.3% |
| Other values (926) | 88323 |
| Value | Count | Frequency (%) |
| 20.0271 | 106 | |
| 20.0827 | 75 | 0.1% |
| 24.6557 | 171 | |
| 26.1184 | 250 | |
| 26.3304 | 29 | < 0.1% |
| Value | Count | Frequency (%) |
| 66.6933 | 1 | < 0.1% |
| 65.6899 | 33 | < 0.1% |
| 64.7556 | 116 | |
| 48.8878 | 228 | |
| 48.8856 | 150 |
long
Real number (ℝ)
| Distinct | 937 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -90.23810249 |
| Minimum | -165.6723 |
|---|---|
| Maximum | -67.9503 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 90767 |
| Negative (%) | 100.0% |
| Memory size | 709.2 KiB |
Quantile statistics
| Minimum | -165.6723 |
|---|---|
| 5-th percentile | -119.7957 |
| Q1 | -96.798 |
| median | -87.4581 |
| Q3 | -80.1284 |
| 95-th percentile | -73.5112 |
| Maximum | -67.9503 |
| Range | 97.722 |
| Interquartile range (IQR) | 16.6696 |
Descriptive statistics
| Standard deviation | 13.80436391 |
|---|---|
| Coefficient of variation (CV) | -0.1529771075 |
| Kurtosis | 1.859241866 |
| Mean | -90.23810249 |
| Median Absolute Deviation (MAD) | 8.1464 |
| Skewness | -1.153925254 |
| Sum | -8190641.849 |
| Variance | 190.5604629 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -82.9832 | 276 | 0.3% |
| -81.7361 | 250 | 0.3% |
| -105.0044 | 249 | 0.3% |
| -83.4845 | 246 | 0.3% |
| -112.8456 | 241 | 0.3% |
| -98.0727 | 241 | 0.3% |
| -97.2893 | 240 | 0.3% |
| -77.867 | 235 | 0.3% |
| -73.098 | 233 | 0.3% |
| -119.7957 | 233 | 0.3% |
| Other values (927) | 88323 |
| Value | Count | Frequency (%) |
| -165.6723 | 116 | |
| -156.292 | 33 | < 0.1% |
| -155.488 | 75 | |
| -155.3697 | 106 | |
| -153.994 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| -67.9503 | 152 | |
| -68.5565 | 69 | |
| -69.2675 | 32 | < 0.1% |
| -69.4828 | 136 | |
| -69.9576 | 43 | < 0.1% |
city_pop
Real number (ℝ)
| Distinct | 856 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88019.77275 |
| Minimum | 23 |
|---|---|
| Maximum | 2906700 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 709.2 KiB |
Quantile statistics
| Minimum | 23 |
|---|---|
| 5-th percentile | 139 |
| Q1 | 743 |
| median | 2456 |
| Q3 | 20328 |
| 95-th percentile | 525713 |
| Maximum | 2906700 |
| Range | 2906677 |
| Interquartile range (IQR) | 19585 |
Descriptive statistics
| Standard deviation | 298024.6363 |
|---|---|
| Coefficient of variation (CV) | 3.385882819 |
| Kurtosis | 37.62957174 |
| Mean | 88019.77275 |
| Median Absolute Deviation (MAD) | 2198 |
| Skewness | 5.581723772 |
| Sum | 7989290713 |
| Variance | 8.881868384 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1312922 | 388 | 0.4% |
| 606 | 381 | 0.4% |
| 1766 | 353 | 0.4% |
| 1595797 | 352 | 0.4% |
| 241 | 319 | 0.4% |
| 198 | 315 | 0.3% |
| 302 | 309 | 0.3% |
| 910148 | 306 | 0.3% |
| 2135 | 286 | 0.3% |
| 276002 | 279 | 0.3% |
| Other values (846) | 87479 |
| Value | Count | Frequency (%) |
| 23 | 132 | |
| 37 | 73 | 0.1% |
| 43 | 150 | |
| 46 | 203 | |
| 47 | 43 | < 0.1% |
| Value | Count | Frequency (%) |
| 2906700 | 268 | |
| 2504700 | 143 | |
| 2383912 | 36 | < 0.1% |
| 1595797 | 352 | |
| 1577385 | 160 |
job
Text
| Distinct | 486 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 709.2 KiB |
Length
| Max length | 59 |
|---|---|
| Median length | 38 |
| Mean length | 20.22102747 |
| Min length | 3 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Architect |
|---|---|
| 2nd row | Firefighter |
| 3rd row | Engineer, petroleum |
| 4th row | Advice worker |
| 5th row | Engineer, mining |
| Value | Count | Frequency (%) |
| engineer | 9169 | 4.5% |
| officer | 7666 | 3.8% |
| manager | 4311 | 2.1% |
| scientist | 3900 | 1.9% |
| designer | 3550 | 1.8% |
| surveyor | 3344 | 1.7% |
| teacher | 2685 | 1.3% |
| psychologist | 2332 | 1.2% |
| research | 2013 | 1.0% |
| editor | 1995 | 1.0% |
| Other values (452) | 160616 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 196159 | 10.7% |
| i | 167019 | 9.1% |
| r | 153990 | 8.4% |
| a | 127805 | 7.0% |
| t | 124373 | 6.8% |
| n | 123357 | 6.7% |
| 110814 | 6.0% | |
| o | 104130 | 5.7% |
| s | 101392 | 5.5% |
| c | 92566 | 5.0% |
| Other values (43) | 533797 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1835402 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 196159 | 10.7% |
| i | 167019 | 9.1% |
| r | 153990 | 8.4% |
| a | 127805 | 7.0% |
| t | 124373 | 6.8% |
| n | 123357 | 6.7% |
| 110814 | 6.0% | |
| o | 104130 | 5.7% |
| s | 101392 | 5.5% |
| c | 92566 | 5.0% |
| Other values (43) | 533797 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1835402 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 196159 | 10.7% |
| i | 167019 | 9.1% |
| r | 153990 | 8.4% |
| a | 127805 | 7.0% |
| t | 124373 | 6.8% |
| n | 123357 | 6.7% |
| 110814 | 6.0% | |
| o | 104130 | 5.7% |
| s | 101392 | 5.5% |
| c | 92566 | 5.0% |
| Other values (43) | 533797 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1835402 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 196159 | 10.7% |
| i | 167019 | 9.1% |
| r | 153990 | 8.4% |
| a | 127805 | 7.0% |
| t | 124373 | 6.8% |
| n | 123357 | 6.7% |
| 110814 | 6.0% | |
| o | 104130 | 5.7% |
| s | 101392 | 5.5% |
| c | 92566 | 5.0% |
| Other values (43) | 533797 |
trans_num
Text
Unique 
| Distinct | 90767 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 709.2 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Unique
| Unique | 90767 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 3d21bce7967838c3988cfe0f7fca878a |
|---|---|
| 2nd row | fda7712b4bbcaab36afded37ab55047f |
| 3rd row | 59161e0002642934974c1ae98bfa1f55 |
| 4th row | f487a7098c0bd4d45f710be1745c4acb |
| 5th row | c2ed76f03cce8a6b362729a5a23f01c2 |
| Value | Count | Frequency (%) |
| 3d21bce7967838c3988cfe0f7fca878a | 1 | < 0.1% |
| d047231d08ba22e60b6ff3b9fc0a50db | 1 | < 0.1% |
| c2ed76f03cce8a6b362729a5a23f01c2 | 1 | < 0.1% |
| 08afca2c21a05c8dfabfc2564d88ced6 | 1 | < 0.1% |
| e0102a704a1c61b7a64d01dd72a2993d | 1 | < 0.1% |
| dc1fa86aba755a0f50097699fe6d44e9 | 1 | < 0.1% |
| 566250df239b92a1250d5b0ace559bf0 | 1 | < 0.1% |
| ac2cd71d98e6cd1f3c98894977d28614 | 1 | < 0.1% |
| c0bb681843a1c1b60739b909ad2955eb | 1 | < 0.1% |
| f47012152799267e90676dfdeb2641da | 1 | < 0.1% |
| Other values (90757) | 90757 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 182302 | 6.3% |
| 4 | 182129 | 6.3% |
| 9 | 182090 | 6.3% |
| e | 182023 | 6.3% |
| 0 | 181972 | 6.3% |
| b | 181802 | 6.3% |
| a | 181494 | 6.2% |
| f | 181432 | 6.2% |
| 1 | 181406 | 6.2% |
| d | 181396 | 6.2% |
| Other values (6) | 1086498 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2904544 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5 | 182302 | 6.3% |
| 4 | 182129 | 6.3% |
| 9 | 182090 | 6.3% |
| e | 182023 | 6.3% |
| 0 | 181972 | 6.3% |
| b | 181802 | 6.3% |
| a | 181494 | 6.2% |
| f | 181432 | 6.2% |
| 1 | 181406 | 6.2% |
| d | 181396 | 6.2% |
| Other values (6) | 1086498 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2904544 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5 | 182302 | 6.3% |
| 4 | 182129 | 6.3% |
| 9 | 182090 | 6.3% |
| e | 182023 | 6.3% |
| 0 | 181972 | 6.3% |
| b | 181802 | 6.3% |
| a | 181494 | 6.2% |
| f | 181432 | 6.2% |
| 1 | 181406 | 6.2% |
| d | 181396 | 6.2% |
| Other values (6) | 1086498 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2904544 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5 | 182302 | 6.3% |
| 4 | 182129 | 6.3% |
| 9 | 182090 | 6.3% |
| e | 182023 | 6.3% |
| 0 | 181972 | 6.3% |
| b | 181802 | 6.3% |
| a | 181494 | 6.2% |
| f | 181432 | 6.2% |
| 1 | 181406 | 6.2% |
| d | 181396 | 6.2% |
| Other values (6) | 1086498 |
merch_lat
Real number (ℝ)
| Distinct | 90516 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.57366865 |
| Minimum | 19.033288 |
|---|---|
| Maximum | 66.624674 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 709.2 KiB |
Quantile statistics
| Minimum | 19.033288 |
|---|---|
| 5-th percentile | 29.7800809 |
| Q1 | 34.7524385 |
| median | 39.415483 |
| Q3 | 41.9808155 |
| 95-th percentile | 46.037913 |
| Maximum | 66.624674 |
| Range | 47.591386 |
| Interquartile range (IQR) | 7.228377 |
Descriptive statistics
| Standard deviation | 5.102019892 |
|---|---|
| Coefficient of variation (CV) | 0.1322669082 |
| Kurtosis | 0.8129226031 |
| Mean | 38.57366865 |
| Median Absolute Deviation (MAD) | 3.383273 |
| Skewness | -0.1818535362 |
| Sum | 3501216.183 |
| Variance | 26.03060698 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39.531126 | 3 | < 0.1% |
| 33.600014 | 3 | < 0.1% |
| 34.484633 | 2 | < 0.1% |
| 35.866069 | 2 | < 0.1% |
| 41.491069 | 2 | < 0.1% |
| 43.108958 | 2 | < 0.1% |
| 39.162349 | 2 | < 0.1% |
| 39.451583 | 2 | < 0.1% |
| 38.663229 | 2 | < 0.1% |
| 38.305552 | 2 | < 0.1% |
| Other values (90506) | 90745 |
| Value | Count | Frequency (%) |
| 19.033288 | 1 | |
| 19.036618 | 1 | |
| 19.04188 | 1 | |
| 19.063792 | 1 | |
| 19.095712 | 1 |
| Value | Count | Frequency (%) |
| 66.624674 | 1 | |
| 66.554249 | 1 | |
| 66.514576 | 1 | |
| 66.454209 | 1 | |
| 66.436224 | 1 |
merch_long
Real number (ℝ)
| Distinct | 90682 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -90.23894977 |
| Minimum | -166.661968 |
|---|---|
| Maximum | -66.962913 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 90767 |
| Negative (%) | 100.0% |
| Memory size | 709.2 KiB |
Quantile statistics
| Minimum | -166.661968 |
|---|---|
| 5-th percentile | -119.4410536 |
| Q1 | -96.882401 |
| median | -87.413904 |
| Q3 | -80.2009085 |
| 95-th percentile | -73.3614921 |
| Maximum | -66.962913 |
| Range | 99.699055 |
| Interquartile range (IQR) | 16.6814925 |
Descriptive statistics
| Standard deviation | 13.8168318 |
|---|---|
| Coefficient of variation (CV) | -0.1531138365 |
| Kurtosis | 1.848091919 |
| Mean | -90.23894977 |
| Median Absolute Deviation (MAD) | 8.256166 |
| Skewness | -1.150196567 |
| Sum | -8190718.754 |
| Variance | 190.904841 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -84.045893 | 2 | < 0.1% |
| -96.477598 | 2 | < 0.1% |
| -80.173837 | 2 | < 0.1% |
| -85.324174 | 2 | < 0.1% |
| -83.964422 | 2 | < 0.1% |
| -97.536537 | 2 | < 0.1% |
| -80.277138 | 2 | < 0.1% |
| -74.54113 | 2 | < 0.1% |
| -77.292665 | 2 | < 0.1% |
| -86.260893 | 2 | < 0.1% |
| Other values (90672) | 90747 |
| Value | Count | Frequency (%) |
| -166.661968 | 1 | |
| -166.65656 | 1 | |
| -166.654993 | 1 | |
| -166.651656 | 1 | |
| -166.628201 | 1 |
| Value | Count | Frequency (%) |
| -66.962913 | 1 | |
| -66.988551 | 1 | |
| -66.996552 | 1 | |
| -66.99675 | 1 | |
| -67.010222 | 1 |
is_fraud
Real number (ℝ)
Zeros 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.005784040455 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 90242 |
| Zeros (%) | 99.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 709.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.07583303164 |
|---|---|
| Coefficient of variation (CV) | 13.11073673 |
| Kurtosis | 167.9046567 |
| Mean | 0.005784040455 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 13.03460613 |
| Sum | 525 |
| Variance | 0.005750648687 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 90242 | |
| 1 | 525 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 90242 | |
| 1 | 525 | 0.6% |
| Value | Count | Frequency (%) |
| 1 | 525 | 0.6% |
| 0 | 90242 |
merch_zipcode
Real number (ℝ)
Missing 
| Distinct | 22188 |
|---|---|
| Distinct (%) | 28.8% |
| Missing | 13714 |
| Missing (%) | 15.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46668.58156 |
| Minimum | 1003 |
|---|---|
| Maximum | 99403 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 709.2 KiB |
Quantile statistics
| Minimum | 1003 |
|---|---|
| 5-th percentile | 7659 |
| Q1 | 24916 |
| median | 45694 |
| Q3 | 68061 |
| 95-th percentile | 92632.8 |
| Maximum | 99403 |
| Range | 98400 |
| Interquartile range (IQR) | 43145 |
Descriptive statistics
| Standard deviation | 25843.17083 |
|---|---|
| Coefficient of variation (CV) | 0.5537595094 |
| Kurtosis | -0.9974583605 |
| Mean | 46668.58156 |
| Median Absolute Deviation (MAD) | 21473 |
| Skewness | 0.1542444807 |
| Sum | 3595954215 |
| Variance | 667869478.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34266 | 36 | < 0.1% |
| 21661 | 30 | < 0.1% |
| 16353 | 27 | < 0.1% |
| 33471 | 25 | < 0.1% |
| 16239 | 25 | < 0.1% |
| 44004 | 24 | < 0.1% |
| 43436 | 23 | < 0.1% |
| 79227 | 23 | < 0.1% |
| 47448 | 23 | < 0.1% |
| 33935 | 23 | < 0.1% |
| Other values (22178) | 76794 | |
| (Missing) | 13714 | 15.1% |
| Value | Count | Frequency (%) |
| 1003 | 1 | < 0.1% |
| 1005 | 5 | |
| 1007 | 8 | |
| 1008 | 3 | < 0.1% |
| 1011 | 5 |
| Value | Count | Frequency (%) |
| 99403 | 2 | < 0.1% |
| 99402 | 2 | < 0.1% |
| 99401 | 2 | < 0.1% |
| 99371 | 7 | |
| 99362 | 4 |
datetime
Text
| Distinct | 90667 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 709.2 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 90567 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | 2019-05-04 11:57:04 |
|---|---|
| 2nd row | 2019-12-14 08:55:21 |
| 3rd row | 2019-03-30 05:21:33 |
| 4th row | 2019-09-19 07:09:46 |
| 5th row | 2019-02-04 20:37:44 |
| Value | Count | Frequency (%) |
| 2019-12-08 | 488 | 0.3% |
| 2019-12-29 | 450 | 0.2% |
| 2019-12-01 | 448 | 0.2% |
| 2019-12-15 | 447 | 0.2% |
| 2019-12-22 | 441 | 0.2% |
| 2019-12-16 | 434 | 0.2% |
| 2019-12-09 | 428 | 0.2% |
| 2019-12-23 | 422 | 0.2% |
| 2019-12-30 | 421 | 0.2% |
| 2019-12-28 | 420 | 0.2% |
| Other values (55750) | 177135 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 317148 | |
| 2 | 250464 | |
| 1 | 239192 | |
| - | 181534 | |
| : | 181534 | |
| 9 | 104273 | 6.0% |
| 90767 | 5.3% | |
| 3 | 84286 | 4.9% |
| 5 | 75187 | 4.4% |
| 4 | 73887 | 4.3% |
| Other values (3) | 126301 | 7.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1724573 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 317148 | |
| 2 | 250464 | |
| 1 | 239192 | |
| - | 181534 | |
| : | 181534 | |
| 9 | 104273 | 6.0% |
| 90767 | 5.3% | |
| 3 | 84286 | 4.9% |
| 5 | 75187 | 4.4% |
| 4 | 73887 | 4.3% |
| Other values (3) | 126301 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1724573 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 317148 | |
| 2 | 250464 | |
| 1 | 239192 | |
| - | 181534 | |
| : | 181534 | |
| 9 | 104273 | 6.0% |
| 90767 | 5.3% | |
| 3 | 84286 | 4.9% |
| 5 | 75187 | 4.4% |
| 4 | 73887 | 4.3% |
| Other values (3) | 126301 | 7.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1724573 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 317148 | |
| 2 | 250464 | |
| 1 | 239192 | |
| - | 181534 | |
| : | 181534 | |
| 9 | 104273 | 6.0% |
| 90767 | 5.3% | |
| 3 | 84286 | 4.9% |
| 5 | 75187 | 4.4% |
| 4 | 73887 | 4.3% |
| Other values (3) | 126301 | 7.3% |
age
Real number (ℝ)
| Distinct | 83 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.56302401 |
| Minimum | 13 |
|---|---|
| Maximum | 95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 709.2 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 32 |
| median | 44 |
| Q3 | 57 |
| 95-th percentile | 79 |
| Maximum | 95 |
| Range | 82 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 17.40714748 |
|---|---|
| Coefficient of variation (CV) | 0.3820454822 |
| Kurtosis | -0.1791284442 |
| Mean | 45.56302401 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.6036856753 |
| Sum | 4135619 |
| Variance | 303.0087833 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 47 | 2894 | 3.2% |
| 34 | 2596 | 2.9% |
| 35 | 2526 | 2.8% |
| 46 | 2469 | 2.7% |
| 43 | 2395 | 2.6% |
| 44 | 2369 | 2.6% |
| 32 | 2301 | 2.5% |
| 33 | 2273 | 2.5% |
| 31 | 2162 | 2.4% |
| 45 | 2072 | 2.3% |
| Other values (73) | 66710 |
| Value | Count | Frequency (%) |
| 13 | 5 | < 0.1% |
| 14 | 294 | |
| 15 | 436 | |
| 16 | 223 | |
| 17 | 170 | 0.2% |
| Value | Count | Frequency (%) |
| 95 | 20 | < 0.1% |
| 94 | 26 | < 0.1% |
| 93 | 286 | |
| 92 | 336 | |
| 91 | 369 |